NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Systematic Study of Popular Software Packages and AI/ML Models for Calibrating In Situ Air Quality Data: An Example with Purple Air Sensors

https://doi.org/10.3390/s25041028

Smith, Seren; Trefonides, Theodore; Srirenganathan_Malarvizhi, Anusha; LaGarde, Shyra; Liu, Jiakang; Jia, Xiaoguo; Wang, Zifu; Cain, Jacob; Huang, Thomas; Pourhomayoun, Mohammad; et al (February 2025, Sensors)

Accurate air pollution monitoring is critical to understand and mitigate the impacts of air pollution on human health and ecosystems. Due to the limited number and geographical coverage of advanced, highly accurate sensors monitoring air pollutants, many low-cost and low-accuracy sensors have been deployed. Calibrating low-cost sensors is essential to fill the geographical gap in sensor coverage. We systematically examined how different machine learning (ML) models and open-source packages could help improve the accuracy of particulate matter (PM) 2.5 data collected by Purple Air sensors. Eleven ML models and five packages were examined. This systematic study found that both models and packages impacted accuracy, while the random training/testing split ratio (e.g., 80/20 vs. 70/30) had minimal impact (0.745% difference for R2). Long Short-Term Memory (LSTM) models trained in RStudio and TensorFlow excelled, with high R2 scores of 0.856 and 0.857 and low Root Mean Squared Errors (RMSEs) of 4.25 µg/m3 and 4.26 µg/m3, respectively. However, LSTM models may be too slow (1.5 h) or computation-intensive for applications with fast response requirements. Tree-boosted models including XGBoost (0.7612, 5.377 µg/m3) in RStudio and Random Forest (RF) (0.7632, 5.366 µg/m3) in TensorFlow offered good performance with shorter training times (<1 min) and may be suitable for such applications. These findings suggest that AI/ML models, particularly LSTM models, can effectively calibrate low-cost sensors to produce precise, localized air quality data. This research is among the most comprehensive studies on AI/ML for air pollutant calibration. We also discussed limitations, applicability to other sensors, and the explanations for good model performances. This research can be adapted to enhance air quality monitoring for public health risk assessments, support broader environmental health initiatives, and inform policy decisions.
more » « less
Full Text Available
Lifting, Loading, and Buckling in Conical Shells

https://doi.org/10.1103/PhysRevLett.131.148202

Duffy, Daniel; McCracken, Joselle M; Hebner, Tayler S; White, Timothy J; Biggins, John S (October 2023, Physical Review Letters)

Full Text Available
Adopting GPU computing to support DL-based Earth science applications

https://doi.org/10.1080/17538947.2023.2233488

Wang, Zifu; Li, Yun; Wang, Kevin; Cain, Jacob; Salami, Mary; Duffy, Daniel Q.; Little, Michael M.; Yang, Chaowei (October 2023, International Journal of Digital Earth)

Full Text Available
Cross-track infrared sounder cloud fraction retrieval using a deep neural network

https://doi.org/10.1016/j.cageo.2022.105268

Liu, Qian; Xu, Hui; Houser, Paul R.; Sun, Donglian; Rice, Matthew; Wang, Likun; Duffy, Daniel Q.; Yang, Chaowei (January 2023, Computers & Geosciences)

Full Text Available
Hyperspectral Infrared Sounder Cloud Detection Using Deep Neural Network Model

https://doi.org/10.1109/LGRS.2020.3023683

Liu, Qian; Xu, Hui; Sha, Dexuan; Lee, Tsengdar; Duffy, Daniel Q.; Walter, Jeff; Yang, Chaowei (September 2020, IEEE Geoscience and Remote Sensing Letters)
null (Ed.)
Full Text Available
Spatiotemporal changes in global nitrogen dioxide emission due to COVID-19 mitigation policies

https://doi.org/10.1016/j.scitotenv.2021.146027

Liu, Qian; Malarvizhi, Anusha Srirenganathan; Liu, Wei; Xu, Hui; Harris, Jackson T.; Yang, Jingchao; Duffy, Daniel Q.; Little, Michael M.; Sha, Dexuan; Lan, Hai; et al (July 2021, Science of The Total Environment)
null (Ed.)
Full Text Available
Spatiotemporal impacts of COVID-19 on air pollution in California, USA

https://doi.org/10.1016/j.scitotenv.2020.141592

Liu, Qian; Harris, Jackson T.; Chiu, Long S.; Sun, Donglian; Houser, Paul R.; Yu, Manzhu; Duffy, Daniel Q.; Little, Michael M.; Yang, Chaowei (January 2021, Science of The Total Environment)
null (Ed.)
Full Text Available
Spatiotemporal event detection: a review

https://doi.org/10.1080/17538947.2020.1738569

Yu, Manzhu; Bambacus, Myra; Cervone, Guido; Clarke, Keith; Duffy, Daniel; Huang, Qunying; Li, Jing; Li, Wenwen; Li, Zhenlong; Liu, Qian; et al (December 2020, International Journal of Digital Earth)
null (Ed.)
Full Text Available
A hierarchical indexing strategy for optimizing Apache Spark with HDFS to efficiently query big geospatial raster data

https://doi.org/10.1080/17538947.2018.1523957

Hu, Fei; Yang, Chaowei; Jiang, Yongyao; Li, Yun; Song, Weiwei; Duffy, Daniel Q.; Schnase, John L.; Lee, Tsengdar (March 2020, International Journal of Digital Earth)
null (Ed.)
Full Text Available
PreciPatch: A Dictionary-based Precipitation Downscaling Method

https://doi.org/10.3390/rs12061030

Xu, Mengchao; Liu, Qian; Sha, Dexuan; Yu, Manzhu; Duffy, Daniel; Putman, William; Carroll, Mark; Lee, Tsengdar; Yang, Chaowei (March 2020, Remote Sensing)

Climate and weather data such as precipitation derived from Global Climate Models (GCMs) and satellite observations are essential for the global and local hydrological assessment. However, most climatic popular precipitation products (with spatial resolutions coarser than 10km) are too coarse for local impact studies and require “downscaling” to obtain higher resolutions. Traditional precipitation downscaling methods such as statistical and dynamic downscaling require an input of additional meteorological variables, and very few are applicable for downscaling hourly precipitation for higher spatial resolution. Based on dynamic dictionary learning, we propose a new downscaling method, PreciPatch, to address this challenge by producing spatially distributed higher resolution precipitation fields with only precipitation input from GCMs at hourly temporal resolution and a large geographical extent. Using aggregated Integrated Multi-satellitE Retrievals for GPM (IMERG) data, an experiment was conducted to evaluate the performance of PreciPatch, in comparison with bicubic interpolation using RainFARM—a stochastic downscaling method, and DeepSD—a Super-Resolution Convolutional Neural Network (SRCNN) based downscaling method. PreciPatch demonstrates better performance than other methods for downscaling short-duration precipitation events (used historical data from 2014 to 2017 as the training set to estimate high-resolution hourly events in 2018).
more » « less
Full Text Available

« Prev Next »

Search for: All records